HRs from survival and other time-to-event data are used extensively as safety and efficacy outcomes
of clinical trials, as well as in large-scale epidemiological studies. Depending on how the output is
formatted, it may show the HR for each predictor in a separate column in the regression table, or it
may create a separate table just for the HRs and their confidence intervals (CIs).
If the software doesn’t output HRs or their CIs, you can calculate them from the regression
coefficients and standard errors (SEs) as follows:
Hazard ratio
Lower 95 percent confidence limit
Upper 95 percent confidence limit
In Figure 23-4, the coefficients are listed under coef, and the SEs are listed under se(coef). HRs are
useful and meaningful measures of the extent to which a variable influences survival.
A HR of 1 corresponds to a regression coefficient of 0, and indicates that the variable has no effect
on survival.
The CI around the HR estimated from your sample indicates the range in which the true HR of the
population from which your sample was drawn probably lies.
In Figure 23-4, the HR for CenterCD is
, with a 95 percent CI of 1.29 to 1.92. This
means that an increase of 1 in CenterCD (meaning being a participant at Centers A or B compared to
being one at Centers C or D) is statistically significantly associated with a 57 percent increase in
hazard. This is because multiplying by 1.57 is equivalent to a 57 percent increase. Similarly, the HR
for Radiation (relative to the comparison, which is chemotherapy) is 0.649, with a 95 percent CI of
0.43 to 0.98. This means that those undergoing radiation had only 65 percent the hazard of those
undergoing chemotherapy, and the relationship is statistically significant.
Risk factors, or predictors associated with increased risk of the outcome, have HRs greater
than 1. Protective factors, or predictors associated with decreased risk of the outcome, have HRs
less than 1. In the example, CenterCD is a risk factor, and Radiation is a protective factor.
Assessing goodness-of-fit and predictive ability of the model
There are several measures of how well a regression model fits the survival data. These measures can
be useful when you’re choosing among several different models:
Should you include a possible predictor variable (like age) in the model?
Should you include the squares or cubes of predictor variables in the model (meaning including
age2 or age3 in addition to age)?
Should you include a term for the interaction between two predictors?